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DETAILED ACTION 



Response to Amendment 



1 . In response to the Office Action of November 14, 2003, Applicants have 
submitted an Amendment, filed February 23, 2004, amending claims 1-3, 5-9, and 12- 
19, and arguing to overcome the art rejections. Claims 1-19 are pending in this 
application. Of the pending claims, claims 1 and 16 are independent claims. 



2. Applicant's arguments with respect to claims 1-19 have been considered but are 
moot in view of the new ground(s) of rejection. 

3. This is a Non-final Office Action in view of new ground(s) of rejection not 
necessitated by Applicant's amendment. 

Drawings 

4. Figures 1a, 1b should be designated by a legend such as -Prior Art- because 
only that which is old is illustrated. See MPEP § 608.02(g). A proposed drawing 
correction or corrected drawings are required in reply to the Office action to avoid 
abandonment of the application. The objection to the drawings will not be held in 
abeyance. 



Response to Arguments 



Claim Rejections - 35 USC § 103 
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Art Unit: 2655 

5. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



6. Claims 1-4, 13, 15-17, and 19 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Applicant's admitted prior art in view of Pastor et al. (U.S. Patent 6,438,513, hereinafter "Pastor[1]"), and 
further in view of Yamaguchi et al. (U.S. Patent 6,026,359, hereinafter "Yamaguchi"). 


Claim(s) 
1 


Applicant's admitted prior art shows: 

A method of voice recognition in a noise-ridden acoustic signal comprising: 

a step of digitizing and subdividing the noise-ridden acoustic signal into a sequence 
of temporal frames; (Fig. 1b: 1) 

a step of parameterizing speech-containing temporal frames so as to obtain a vector 
of parameters, per speech containing frame, in the frequency domain, the vector of 
parameters expressing the acoustic contents of each speech containing frame; (Fig. 1b: 2) 

a shape-recognition step in which the vectors of parameters are assessed with 
respect to references pre-recorded in a reference space during a preliminary learning step, so 
as to obtain recognition by the determining of at least one reference which is closest to the 
vector of parameters; (Fig. 1b: 3) 

Applicant's admitted prior art does not show: 

a step of reiterative searching for successive noise models in the sequence of 
temporal frames, a new noise model replacing a current noise model, each noise model 
comprising several successive frames; 

a step of searching for a noise transition between the new noise model and the 
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current noise model; and wherein, 

when the noise transition has been detected, the method comprises a step of 
updating the reference space as a function of the new noise model, the parameterizing step 
including a step of matching the parameters to the new noise model. 

Pastorfll teaches: 

a step of reiterative searching for successive noise models in the sequence of 
temporal frames, a new noise model replacing a current noise model, each noise model 
comprising several successive frames; (see Abstract) 

a step of searching for a noise transition between the new noise model and the 
current noise model; (Fig. 3: "comparison with energy of the previous model") 

Yamaquchi teaches: 

a step of updating (adapting) the reference space (initial noisy speech model) as a 
function of the new noise model (noisy speech model after condition change); (Fig 3; col. 2, 
II.20-28; col.4, II.47-67; col.5, 11.1-52) 

{The initial noisy speech model are adapted the new noisy speech model after a detection of 
a change in the energy spectrum. It would be obvious to match to the parameters to the new 
noisy speech model since the new noisy speech model represents the most current and 
updated model.} 

a step of matching the parameters to the new noise model. (Fig. 3: 11,12) 
{speech recognition 11 is performed using the adapted noisy speech model 12.} 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to modify the speech recognition method of Applicant's admitted prior art 
to include the reiterative and noise transition search of Pastor[1] and the step of updating a 
reference space as a function of a new noise model as taught by Yamaguchi in order to 
improve the robustness of speech recognition. The reiterative search method for noise 
reduction is useful in aircraft where noise from engines, air-conditioning, aerodynamics are 
continuously presented and where intelligent speech recognition in such noisy environment is 
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important (Pastor[1], col.1, 11.14-42). The updating of noise models is useful for improving 
recognition rate (Yamaguchi: col.1, 11.21-52). 


Claim(s) ^ 
2 


!jV l n The combination of Applicant's admitted prior art and PactorfU would show: 

A method of voice recognition according to claim 1 , wherein the step of searching for 
a noise transition comprises a step of searching for an energy incompatibility and/or a step of 
searcning ior a specirai incompaiiDiiny Deiween me new noise mooei ana tne current moaei, 
the detection of an incompatibility expressing a noise transition. (Pastor[1]: col.4, II. 55-67; 
col.5, 11.1-8) 

{spectral incompatibility search is met by mean energy comparison between the old and new 
noise model.} 


Claim(s)^ 
3 


) ( ^ The oombination of Applicant's admitted prior art -and PastorHl would show: 

A method of voice recognition according to claim 2, wherein the step of searching for 
an energy incompatibility comprises a comparison of the ratio between a mean energy 
Enewmod of the new noise model and a mean energy of the current noise model Emodcurr 
with a low threshold S' and a high threshold S, an energy incompatibility being found if the 
ratio is outside the interval delimited by the two thresholds S, S\ (Pastor[1]: col.5, II. 58-67; 
col.6, 11.1-6) 


Claim(s) ^ 
4 


Y' The combination of Applicant's admitted prior art and Pootorf H- would show: 

A method of voice recognition according to claim 3, wherein the step of searching for 
an energy incompatibility also comprises a comparison of the mean energy Enewmod of the 
new noise model and the mean energy of the current noise model Emodcurr with an energy 
floor threshold Emin below which the noise is negligible, the energy incompatibility 
determined by the comparison of the ratio between the mean energy of the new noise model 
Enewmod and the mean energy of the current noise model Emodcurr being ignored when the 
mean energy of the new noise model Enewmod and the mean energy of the current noise 
model Emodcurr are both below the energy floor threshold Emin. (Pastor[1]: Fig. 3: "Halt the 
formulation", col.6, II.25-34) 

{Emin can be a value < 1/S (S-1/S) in which case the comparison of ratio of mean energy is 
abandoned due to insignificant presence of ambient noise.} 
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Claim(s) 
13 


Applicant's admitted prior art shows: 

A method according to claim 1, using, as references, a sequence of temporal frames 
corresponding to one or more words, this sequence of frames being identified by a series of 
vectors of parameters, said parameters being obtained by compression of spectral 
coefficients. (Fig. 1b: 2, 3) 


Claim(s) 
15 


Applicant's admitted prior art shows: 

A method according to claim 1 , comprising a step of noise-suppression in the speech- 
containing temporal frames before the parameterizing step. (Fig. 1b: 6) 


Claim(s) 
16 


Applicant's admitted prior art shows: 

A system of voice recognition in a noise-ridden acoustic signal comprising: 

means to acquire the noise-ridden acoustic signal, digitize the noise-ridden acoustic 
signal and subdivide subdivide the noise-ridden acoustic signal into temporal frames; (Fig. 1b: 
1) 

a parameterizing chain to translate the temporal frames containing speech into 
vectors of parameters in the frequency domain; (Fig. 1b: 2) 

shape-recognition means with a reference space having references acquired during a 
learning stage, to compare the vectors of parameters coming from the parameterizing chain 
with the references, so as to obtain recognition by the determination of a reference that most 
closely approaches the vectors of parameters; (Fig. 1b: 3) 

Applicant's admitted prior art does not show: 

means for modeling the noise to reiteratively prepare noise models, a new noise 
model replacing a current noise model; 

means for detecting a noise transition between the new noise model and the current 
noise model; 
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means to match the parameterizing chain with the new noise model having activated 
the noise transition; and 

means to update the references of the reference space as a function of a noise level 
of the new noise model having activated the noise transition. 

PastorfU teaches: 

means for modeling the noise to reiteratively prepare noise models, a new noise 
model replacing a current noise model; and (see Abstract) 

means for detecting a noise transition between the new noise model and the current 
noise model. (Fig. 3: "comparison with energy of the previous model") 

Yamaquchi teaches: 

means to match the parameterizing chain with the new noise model having activated 
the noise transition; (Fig. 3: 11,12) and 

{speech recognition 11 is performed using the adapted noisy speech model 12.} 

means to update (adapting) the references of the reference space (initial noisy 
speech model) as a function of a noise level of the new noise model (noisy speech model 
after condition change) having activated the noise transition. (Fig 3; col.2, II.20-28; col.4, II.47- 
67; col.5, 11.1-52) 

(The initial noisy speech model are adapted the new noisy speech model after a detection of 
a change in the energy spectrum. It would be obvious to match to the parameters to the new 
noisy speech model since the new noisy speech model represents the most current and 
updated model.} 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to modify the speech recognition method of Applicant's admitted prior art 
to include the reiterative and noise transition search of Pastor[1] and the means of updating a 
reference space as a function of a new noise model as taught by Yamaguchi in order to 
improve the robustness of speech recognition. The reiterative search method for noise 
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reduction is useful in aircraft where noise from engines, air-conditioning, aerodynamics are 
continuously presented and where intelligent speech recognition in such noisy environment is 
important (Pastor[1], col.1, 11.14-42). The updating of noise models is useful for improving 
recognition rate (Yamaguchi: col.1, 11.21-52). 


Claim(s) 
17 


The combination of Applicant's admitted prior art, Pastorfll, and Yamaauchi would show: 


A system of voice recognition according to claim 16, wherein the means used to 
update the references of the reference space comprise a first memory space (Fig. 3: 1 1 ) to 
store the updated references, said updated references having to replace current references 
used for shape recognition before the detection of the noise transition, said current references 
being stored in a second memory space (Fig. 3: 6). (Yamaguchi: Fig. 3) 


Claim(s) 
19 


Applicant's admitted prior art shows: 

A voice-recognition system according to claim 16, comprising means for noise- 
suppression (Fig. 1b: 6) in the speech-containing temporal frames before the speech 
containing temporal frames are translated by said parameterizing chain (Fig. 1b: 2). 



7. Claim 7 is rejected under 35 U.S.C. 103(a) as being unpatentable over Applicant's admitted prior 
art in view of Pastor[1] and Yamaguchi, and further in view of Pastor et al. (U.S. Patent 6,445,801 , 
hereinafter u Pastor[2]"). 


Claim(s) 
7 


The modified Applicant's admitted prior art does not show: 

A method of voice recognition according to claim 1, wherein the parameterizing step 
comprises a step of determining spectral coefficients Bi.par, each associated with a frequency 
channel i each expressing a representation of a spectral energy of a frame containing speech 
in the frequency channel i, the parameter-matching step comprising a determining, for each 
spectral coefficient Bi.par, of a robustness operator OpRob(Bi,par), the robustness operator 
expressing the confidence to be attached to the spectral coefficient Bi.par with respect to the 
noise level of the new noise model in the same frequency channel i, a weighting of the 
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spectral coefficient Bi,par with the robustness operator OpRob(Bi.par) and a determining of 
the vector of parameters on the basis of the weighted spectral coefficients. 

Pastoral teaches: 

a parameterizing step comprises: 

a step of determining spectral coefficients Bi.par ("yj(v), y x (v)"), each associated with a 
frequency channel i (v) each expressing a representation of a spectral energy of a frame 
containing speech in the frequency channel I (Fig. 8, estimation of H yj(v), y x (v)"; col. 14, 11.17- 
20), the parameter-matching step comprising: 

a determining, for each spectral coefficient Bi.par ( l, yj(v), y x (v)"), of a 
robustness operator OpRob(Bi.par) (e.g., a), the robustness operator expressing the 
confidence (e.g., weighting of spectral coefficients on a: see eq.8) to be attached to the 
spectral coefficient Bi,par with respect to the noise level of the new noise model in the same 
frequency channel i, (col. 7, eq.8; col.8, 11.10-18) 

a weighting of the spectral coefficient Bi,par with the robustness operator 
OpRob(Bi.par) and (col.7, eq.8; col.7, II.66-67; col.8, 11.1-6; col.8, II.37) 

a determining of the vector of parameters on the basis of the weighted 
spectral coefficients. (Fig. 8, "computation of the Wiener coefficient"; col.7, eq.8) 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to include the spectral coefficient determination step of Pastor[2] in the 
modified Applicant's admitted prior art in order to perform noise suppression using Wiener 
filter. Wiener filter is a commonly used noise filter (Pastor[2]: col.1, II. 58-67; col.2, 1-5) 
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8. Claim 14 is rejected under 35 U.S.C. 103(a) as being unpatentable over Applicant's admitted prior 
art in view of Pastor[1] and Yamaguchi, and further in view of Takahashi et al. (U.S. Patent 5,793,891 , 
hereinafter "Takahashi"). 


Claim (s) 
14 


The modified Applicant's admitted prior art does not show: 

A method according to claim 1 , using, as references, a sequence of temporal frames 
corresponding to one or more phonemes, said sequence of frames being identified by a 
center and a standard deviation of a Gaussian function, said center and said standard 
deviation depending on the parameters of the vectors of parameters of the frames. 

Takahashi teaches: 

using, as references, a sequence of temporal frames corresponding to one or more 
phonemes, said sequence of frames being identified by a center (mean) and a standard 
deviation (variance) of a Gaussian function, said center and said standard deviation 
depending on the parameters of the vectors of parameters of the frames, (col. 14, 11.1 3-30; 
col.2, 11.31-47) 

It would have been obvious to a person of ordinary skill in the art at the time the 
invention was made to include in the modified Applicant's admitted prior art the 
representation of reference phonemes by Gaussian functions as taught by Takahashi in order 
model the noisy speech using Gaussian functions. The Gaussian function is commonly used 
to model additive noise and provides a good noise model to model noises in noisy speech. 



Allowable Subject Matter 
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9. Claims 5, 6, 8-12, and 18 are objected to as being dependent upon a rejected base claim, but 
would be allowable if rewritten in independent form including all of the limitations of the base claim and 
any intervening claims. 

10. The following is a statement of reasons for the indication of allowable subject matter: 


Claim(s) 
5 


The prior art fails to show: 

A method of voice recognition according to claim 2, wherein the step of searching for 
spectral incompatibility comprises, on the basis of spectral coefficients Bi.modcurr Bi.newmod 
respectively expressing a spectral energy of frames of the current noise model and a spectral 
energy of frames of the new noise model in at least one frequency channel i, a comparison of 
the ratio between the spectral coefficient Bi,newmod associated with the frequency channel i 
of the new noise model and the spectral coefficient Bi.modcurr associated with the same 
frequency channel i of the current noise model with a low threshold Sf and a high threshold 
Sf, a spectral incompatibility being found if the ratio is located outside the interval delimited by 
the two thresholds, Sf, Sf. 


Claim(s) 
6 


The Drior art fails to show: 

A method of voice recognition according to claim 5, wherein the step of searching for 
a spectral incompatibility also comprises, for at least one frequency channel i, a comparison 
of the spectral coefficient Bi.newmod of the new noise model in the frequency channel i and 
of the spectral coefficient Bi.modcurr of the current noise model in the frequency channel i 
with a floor spectral coefficient Bi.min associated with the frequency channel i, namely a floor 
below which the noise is negligible, a spectral incompatibility determined by the comparison 
of the ratio between spectral coefficients being ignored when, for the frequency channel i, the 
spectral coefficients of the new noise model and of the current noise model are both below 
the floor spectral coefficient Bi.min. 


Claim(s) 
8 


The prior art fails to show: 

A method according to claim 7, wherein the robustness operator OpRob(Bi.par) 
verifies the following relationship: 
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OpRob{B, par ) = < 



max 



0.25 + 



B. -P(B. 

i, par V i, 



new mod 



V 



^i,par ^^(^i, new mod ) 



,o 



Bi,par being the spectral coefficient and P(Bi,newmod) being a parameter dependent on the 
noise level of the new noise model having activated the transition in the frequency channel i. 



Claim(s) 
9 



The prior art fails to show: 

A method of voice recognition according to claim 1, further comprising reference 
space updating step comprising the following operations, on the basis of basic spectral 
coefficients each associated with a frequency channel i, each expressing a spectral energy of 
a basic frame obtained during the learning step: 



the determining of a robustness operator OpRob(Bi,base) for each basic spectral 
coefficient Bi.base, the robustness operator expressing the confidence to be attached to the 
basic spectral coefficient Bi,base with respect to a noise level of the new noise model in the 
same frequency channel i, a weighting of the basic spectral coefficients Bi.base with the 
respective robustness operators OpRob(Bi,base), and preparation of the updated references 
with the weighted basic spectral coefficients. 



Claim(s) 
10 



The prior art fails to show: 

A method according to claim 9, wherein the robustness operator OpRob(Bi.base) for 
the updating of the reference space verifies the following relationship: 



OpRob(B iMse ) 



maxf 0.25 + P( ^-" e "™ d) # 



^ ^i,base ^^(^newmod ) ) j 



Bi.base being the basic spectral coefficient and P(Bi,newmod) being a parameter depending 
on the noise level of the new noise model having activated the transition, in the frequency 
channel i. 
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Claim(s) 
11 


The Drior art fails to show: 

A method according to claim 9, in which the references are prepared on the basis of 
compressed basic spectral coefficients, wherein the method uses a conversion table to 
convert the compressed basic spectral coefficients into compressed and weighted basic 
spectral coefficients. 


Claim(s) 
12 


The prior art fails to show: 

A method according to claim 1 1, wherein the conversion table contains the non- 
compressed basic spectral coefficients Bi.base obtained by application of the reverse of the 
compression function to the compressed basic coefficients and wherein the method 
comprises: 

determining the robustness operator OpRob(Bi,base) for each of the non-compressed 
basic spectral coefficients Bi.base, a weighting of the non-compressed basic spectral 
coefficients Bi.base, a compression of the non-compressed and weighted basic spectral 
coefficients so as to obtain the compressed and weighted basic spectral coefficients. 


Claim(s) 
18 


The prior art fails to show: 

A voice-recognition system according to claim 16, comprising a memory space to 
store compressed basic spectral coefficients obtained from basic spectral coefficients each 
associated with a frequency channel i, said basic spectral coefficients each expressing the 
spectral energy of a basic frame coming from the learning stage, a conversion table to 
convert the compressed basic spectral coefficients into compressed and weighted basic 
spectral coefficients, each weighted by a robustness operator OpRob(Bi,base) as a function 
of the noise level of the new noise model having activated the noise transition and of the 
basic spectral coefficient (Bi.base) to be weighted, said compressed and weighted basic 
spectral coefficients being used for the updating of the references of the reference space. 
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Conclusion 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Tim Lao whose telephone number is 703-305-8955. 
The examiner can normally be reached on M-F, 8:30am-5pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Doris To can be reached on 703-305-4827. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 



Tim Lao 
Examiner 
Art Unit 2655 
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